High Performance Training of Feedforward & Simple Recurrent Networks

نویسندگان

Barry L. Kalman

Stan C. Kwasny

چکیده

TRAINREC is a system for training feedforward and recurrent neural networks that incorporates several ideas. It uses the conjugate-gradient method which is demonstrably more efficient than traditional backward error propagation. We assume epoch-based training and derive a new error function having several desirable properties absent from the traditional sum-of-squares-error function. We argue for skip (shortcut) connections where appropriate and the preference for a bipolar sigmoidal yielding values over the [-1,1] interval. The input feature space is often over-analyzed, but by using singular value decomposition , input patterns can be conditioned for better learning often with a reduced number of input units. Recurrent networks, in their most general form, require special handling and cannot be simply a rewiring of the architecture without a corresponding revision of the derivative calculations. There is a careful balance required among the network architecture (specifically, hidden and feedback units), the amount of training applied, and the ability of the network to generalize. These issues often hinge on selecting the proper stopping criterion. Discovering methods that work in theory as well as in practice is difficult and we have spent a substantial amount of effort evaluating and testing these ideas on real problems to determine their value. This paper encapsulates a number of such ideas ranging from those motivated by a desire for efficiency of training to those motivated by correctness and accuracy of the result. While this paper is intended to be self-contained, several references are provided to other work upon which many of our claims are based. The popularity of neural networks continues to increase, particularly as researchers have realized the importance of recurrent networks. Recurrent architectures have the advantage of being able to remember, to some degree, what inputs or events have occurred earlier in a sequence and are able to exert control of future decision-making based on these past events. For example Servan-Schreiber et. al.[26] discuss usage of recurrent networks as representations of state machines. The challenges of recurrent networks are similar but not precisely the same as those for ordinary, feedforward networks. Neural networks, as a field, is graduating from the novelty and toy problem stage and is becoming an important and useful tool for a wide variety of problem areas. As our understanding of the methodology increases through research, we become better equipped to address tasks on the scale of practical, real-world problems. Likewise, it seems, as our ability to train …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High performance training of feedforward and simple recurrent networks

متن کامل

Chapter 12 TRAINING RECURRENT NETWORKS FOR FILTERING AND CONTROL

Neural networks can be classified into recurrent and nonrecurrent categories. Nonrecurrent (feedforward) networks have no feedback elements; the output is calculated directly from the input through feedforward connections. In recurrent networks the output depends not only on the current input to the network, but also on the current or previous outputs or states of the network. For this reason, ...

متن کامل

TRAINREC: A System for Training Feedforward & Simple Recurrent Networks Efficiently and Correctly

متن کامل

Memetic cooperative coevolution of Elman recurrent neural networks

Cooperative coevolution decomposes an optimisation problem into subcomponents and collectively solves them using evolutionary algorithms. Memetic algorithms provides enhancement to evolutionary algorithms with local search. Recently, the incorporation of local search into a memetic cooperative coevolution method has shown to be efficient for training feedforward networks on pattern classificati...

متن کامل

Regularizing Recurrent Networks - On Injected Noise and Norm-based Methods

Advancements in parallel processing have lead to a surge in multilayer perceptrons’ (MLP) applications and deep learning in the past decades. Recurrent Neural Networks (RNNs) give additional representational power to feedforward MLPs by providing a way to treat sequential data. However, RNNs are hard to train using conventional error backpropagation methods because of the difficulty in relating...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1994

High Performance Training of Feedforward & Simple Recurrent Networks

نویسندگان

چکیده

منابع مشابه

High performance training of feedforward and simple recurrent networks

Chapter 12 TRAINING RECURRENT NETWORKS FOR FILTERING AND CONTROL

TRAINREC: A System for Training Feedforward & Simple Recurrent Networks Efficiently and Correctly

Memetic cooperative coevolution of Elman recurrent neural networks

Regularizing Recurrent Networks - On Injected Noise and Norm-based Methods

عنوان ژورنال:

اشتراک گذاری